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Summary 

The causative agent of severe acute respiratory syn¬ 
drome (SARS) has been identified as a novel coronavi¬ 
rus, SARS-CoV. The main proteinase of SARS-CoV, 
3CL pro , is an attractive target for therapeutics against 
SARS owing to its fundamental role in viral replication. 
We sought to identify novel inhibitors of 3CL pro to ad¬ 
vance the development of appropriate therapies in the 
treatment of SARS. 3CL pro was cloned, expressed, and 
purified from the Tor2 isolate. A quenched fluores¬ 
cence resonance energy transfer assay was devel¬ 
oped for 3CL pro to screen the proteinase against 50,000 
drug-like small molecules on a fully automated sys¬ 
tem. The primary screen identified 572 hits; through a 
series of virtual and experimental filters, this number 
was reduced to five novel small molecules that show 
potent inhibitory activity (IC 50 = 0.5-7 |jlM) toward 
SARS-CoV 3CL pro . 

Introduction 

The first wave of Severe Acute Respiratory Syndrome 
(SARS) was successfully contained by the summer of 
2003 [1], but not until close to 8500 people worldwide 
were infected, and over 900 had died [2]. Upon the emer¬ 
gence of this syndrome, international response was 
quick to identify and characterize its causative agent as 
a novel coronavirus, SARS-CoV [3-7]. The reemergence 
of SARS in the Guangdong province of China in Decem¬ 
ber 2003 [8] and the spring of 2004 [9], while not neces¬ 
sarily indicative of another global outbreak, illustrates 
the need to continue efforts to study this virus and de¬ 
velop appropriate therapeutics for its treatment. 

SARS-CoV is an enveloped, positive-stranded RNA 
virus whose genome is predominated by two open read¬ 
ing frames that are connected by a ribosomal frameshift 
site and that encode the two replicase proteins, ppla 
and pplab [10, 11]. These polyproteins are cleaved by 
the main proteinase 3CL pro [11] (also called M pro [12,13]) 
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in the first step of the formation of the crucial replication- 
transcription complex. The activity of 3CL pro , so named 
for its similarity to 3C proteinases of Picornaviridae [14], 
is an attractive target for the development of therapeu¬ 
tics against SARS due to its fundamental role in viral 
replication. Homology modeling with other 3CL pro ’s [12, 
15], X-ray crystallography [13], and a recent mutagene¬ 
sis study [16] have identified 3CL pro as a cysteine prote¬ 
ase with a Cys-His catalytic dyad in the active site. 
Structure-based sequence alignments of ppla and 
pplab of SARS-CoV with the equivalent polyproteins 
from human coronavirus 229E (HCoV), porcine transmis¬ 
sible gastroenteritis virus (TGEV), bovine coronavirus 
(BCoV), and avian infectious bronchitis virus (IBV) have 
identified the consensus sequence for proteolysis as 
(S, T, V, P, A)-(L, I, V, F, M)-Q/(A, S, G, N, C) [12]. In vitro 
studies have confirmed the ability of 3CL pro to process 
sites of this type, and that the proteinase has the highest 
specificity for the cleavage sites flanking 3CL pro in the 
ppla/pplab polyprotein [11, 17]. 

A structural model of 3CL pro based on sequence ho¬ 
mology with TGEV [12] as well as the solved crystal 
structure [13] have both been used in a number of stud¬ 
ies to dock substrate mimics [12,13,18] and for virtual 
screening of collections of synthetic compounds, natu¬ 
ral products, and approved antiviral therapies to evalu¬ 
ate their ability to inhibit 3CL pro [15,19-22]. Compounds 
identified as potential inhibitors of 3CL pro from these 
studies include the HIV-1 protease inhibitor L-700,417 
[15], the reverse transcriptase inhibitors calanolide A 
and nevirapine [20], glycovir, an a-glucosidase inhibitor 
[20], sabadinine, a natural product [21], and the general 
antiviral ribavirin [20] . Ribavirin has been shown to ex¬ 
hibit anti-SARS-CoV activity in vitro, but at concentra¬ 
tions that are cytotoxic [23]. At the onset of the initial 
SARS outbreak, this compound was used as a first-line 
defense both as a monotherapy [24] and in combination 
with corticosteroids [25] or Kaletra (an approved HIV 
protease inhibitor) [24]; however, it is now generally be¬ 
lieved that ribavirin is not an effective treatment for SARS 
([26, 27] and references therein). At the time of writing, 
it is not known whether the other compounds identified 
through virtual means have yet been tested with 3CL pro 
or SARS-CoV to evaluate their actual inhibitory capabil¬ 
ities. 

Several in vitro screens have also been recently re¬ 
ported that have assessed the ability of existing protein¬ 
ase inhibitors to inhibit replication of SARS-CoV and, 
in some cases, 3CL pro specifically. The HIV-1 protease 
inhibitors indinavir, saquinavir, ritonavir, lopinavir, TYA5, 
TYB5, and KNI-272 were shown to be ineffective at inhib¬ 
iting replication of SARS-CoV in vitro [23, 28]. Another 
HIV-1 protease inhibitor, nelfinavir, was shown to inhibit 
SARS-CoV replication with an EC 50 of 48 nM [28], yet it 
has also been reported that complete inhibition of viral 
replication was not seen with compound concentrations 
up to 10 |xM [23]. Another in vitro screen that similarly 
tested a library of 500 protease inhibitors resulted in 
only one compound that inhibited 3CL pro ; this molecule 
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had been developed as a transition-state analog inhibi¬ 
tor for HIV-1 protease and had a K, of 0.6 |jlM with 
3CL pro [29]. 

Although a number of existing drugs that could poten¬ 
tially act as inhibitors of 3CL pro were identified through 
virtual screening, other than ribavirin, it is not known 
how active these compounds actually are toward the 
protease. Additionally, in vitro screening campaigns il¬ 
lustrate that out of over 500 existing protease inhibitors, 
only two were identified as active toward 3CL pro . These 
findings illustrate the need to look toward the develop¬ 
ment of innovative inhibitors for this protease. One such 
study has recently identified unique keto-glutamine ana¬ 
logs as inhibitors of 3CL pro with IC 50 ’s in the low micromo¬ 
lar range (J.C. Vederas, submitted). In an effort to ad¬ 
vance this initiative, we sought to identify novel small 
molecules that specifically target 3CL pro . In this paper, 
we describe the cloning, expression, and purification of 
3CL pro from the SARS-CoVTor2 isolate, and subsequent 
screening campaign against a library of 50,000 small 
molecules. We herein report the finding of five novel 
inhibitors of 3CL pro . 


Results and Discussion 

In order to screen SARS-CoV 3CL pro against a library of 
50,000 compounds, it was necessary to use an activity 
assay for the proteinase that could be adapted to an 
automated system. Ideally, such an assay would (1) work 
over a time scale of several minutes at most to facilitate 
the rapid evaluation of thousands of potential inhibitors; 
(2) be homogeneous for ease of automation; and (3) 
be sensitive to minimize background signal from the 
presence of the compounds tested. Quenched fluores¬ 
cence resonance energy transfer (FRET) assays have 
become a common tool to monitor proteinase activity 
[30] and fulfill the necessary criteria for use in an efficient 
and robust high-throughput screening (HTS) campaign. 
The basis for such an assay is the modification of a 
peptide substrate to include a fluorescent label and 
quencher on opposing sides of the proteinase cleavage 
site (Figure 1). Using this substrate, the activity of a 
proteinase can be measured directly without the need 
for additional steps to purify and/or characterize the 
products. This type of assay is also beneficial as it per¬ 
mits enzymatic reactions to be monitored in real time 
to obtain accurate reaction rates. This aspect of the 
quenched-FRET assay is particularly attractive since, in 
evaluating the activity of a proteinase in terms of a rate, 
the potential for false negatives to occur in the screen 
due to the presence of inherent fluorescence from the 
compounds being tested is reduced. 

Two fluorogenic peptide substrates were synthesized 
to develop a continuous assay of 3CL pro proteolytic activ¬ 
ity based on FRET. The peptides were designed princi¬ 
pally on the sequences flanking 3CL pro in the polyprotein 
and solubility considerations. A peptide incorporating 
the anthranilate-nitrotyrosine donor-acceptor pair (Abz- 
SVTLQSG-Tyr(N0 2 )R) (Figure 1) was over an order of 
magnitude more sensitive in FRET-based assays than 
the equivalent Edans-Dabcyl peptide (data not shown) 
and was the preferred substrate for HTS. In the analysis 
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Figure 1. The Quenched-FRET Assay Used to Monitor 3CL pro Ac¬ 
tivity 

(A) The substrate is a 9 amino acid peptide labeled with 2-aminoben- 
zoyl (Abz) and a nitro-tyrosine on either side of the scissile peptide 
bond (dashed) to act as donor and acceptor, respectively, for FRET. 
In the intact substrate, excited state energy of Abz is transferred to 
the nitro-tyrosine via FRET and fluorescence is quenched. 

(B) Once the substrate is cleaved by 3CL pro , the donor and acceptor 
are spatially separated; subsequent fluorescence from the Abz func¬ 
tionality is not quenched and can be detected. 


of kinetic data using the Abz-Tyr(N0 2 ) substrate, the 
primary data were corrected for inner filter effect [31]. 
This adjustment was essential as it was observed that 
at even relatively low concentrations of substrate (2 |xM), 
the fluorescence of the cleaved product was signifi¬ 
cantly quenched by the nitrotyrosine moiety of the intact 
substrate. 

The K m for the Abz-Tyr(N0 2 ) substrate with 3CL pro was 
820 ±130 |xM (Figure 2A). This value is comparable to 
those seen recently for nonlabeled 11 -mer peptides that 
also mimic the cleavage sites of the natural substrates 
of 3CL pro (0.286-1.94 mM) [17], implying that the labels 
incorporated for FRET were not detrimental to the asso¬ 
ciation between enzyme and substrate. The calculated 
values of the apparent k cat and k cat /K m for the labeled 
substrate were 1.01 ± 0.09 min -1 and 1.2 ± 0.2 mM -1 
min -1 respectively, and were also similar to those re- 
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Figure 2. Activity of 3CL pro with the Fluorogenic Peptide Substrate 

(A) Michaelis-Menton plot to determine K m and the observed k cat . 

(B) A plot of observed assay rates containing 100 |xM substrate and 
varying concentrations of 3CL pro demonstrates assay linearity with 
respect to proteinase concentration. 
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Figure 3. Plot of the Residual Activity of the Controls from the 
Screen 

High (Replicate 1, orange circles; Replicate 2, green diamonds) and 
low (Replicate 1, red triangles; Replicate 2, blue squares) controls 
are clustered around 100% and 0% residual activity, respectively. 
The Z' of 0.60 is reflected in the well-defined hit window. 
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ported for the 11-mer peptides (0.847-12.2 min -1 and 
0.202-10.6 mM _1 min _1 ) [17]. These kinetic parameters 
are reported as apparent values since it has been re¬ 
cently shown [17, 32] that at the concentration of 3CL pro 
used in this study (1 |jlM) much of the enzyme is thought 
to be monomeric, whereas it is the dimeric form of the 
enzyme that is believed to be catalytically competent. 

For the purposes of screening, it is desirable to use the 
lowest concentration of substrate that yields a reliable, 
reproducible signal. It was found that assays using 100 
|jlM of the fluorogenic peptide gave a strong signal that 
was linear with varying 3CL pro concentration (Figure 2B). 
This enabled us to screen at substrate concentrations 
well below K m , which is advantageous for identifying 
inhibitors that compete with substrate for the enzyme 
active site [33] and are therefore more likely to be spe¬ 
cific toward 3CL pro . 

The compound library used to screen 3CL pro contained 
50,000 small molecules with an average molecular mass 
of 325 g/mol. This particular collection was screened 
because of its high quality, diversity, drug-likeness [34], 
resupply availability, and past success with other targets 
[35]. The campaign was run in duplicate in 384-well 
microplate format on a fully automated system. 

The statistical parameter Z', which is described as a 
measure of the quality of an HTS campaign [36], is de¬ 
fined as 

“\ — Z' = @ ac + ^ (J c ) 

Im"c+ R'c-I 

where a c+ , a c _, |x c+ , and |x c _ are the standard deviations 
(a) and averages (|x) of the high ( c+ ) and low ( c _) controls. 
This value reflects both the error associated with the 
controls of a screen, as well as the size of the “hit win¬ 
dow” (Figure 3). A Z' of 0.5 or greater is indicative of a 
quality screen with a well-defined hit window [36]. The 
Z' for the primary screen of SARS-CoV 3CL pro was 0.60 
for each of the two replicates indicating that the assay 
used to detect activity of 3CL pro was robust and amena¬ 
ble to HTS. The replicate plot of residual enzyme activity 


Figure 4. Replicate Plot for the Primary Screen of SARS-CoV 3CL pro 
The outlined hit zone was defined as 50% residual enzyme activity. 

with each of the tested library compounds (Figure 4) 
also illustrates the quality of the screen in terms of the 
reproducibility between the duplicate sets of data. This 
figure also indicates that 3CL pro appears to have been 
inhibited to some degree by a large number of com¬ 
pounds. The cutoff to determine the hit threshold was 
set to 50% residual enzyme activity in order to focus 
on the most potent inhibitors. 

Five hundred and seventy-two hits (1.1 % hit rate) were 
identified from the primary screen of 3CL pro . For the 
preliminary analyses of these hits, we opted to restrict 
the number of compounds entering secondary screen¬ 
ing by a series of virtual and experimental filters. The 
hits were first classified by recursive partitioning into 
126 groups based on similarities between molecular 
structure and activity in the primary screen [37, 38]. 
Within each of these groups, the compound that strongly 
inhibited 3CL pro was chosen as a representative candi¬ 
date for further study in secondary analyses. The dose- 
response curves of the 126 representative compounds 
were determined using standard assay conditions (100 
|jlM substrate, 1 |xM 3CL pro ) and 90 nM-100 |jlM of tested 
compound. The dose-response relationship of 72 com¬ 
pounds resulted in the typical sigmoidal semilogarithmic 
curve associated with desirable inhibitors; the 54 com¬ 
pounds that did not result in this typical curve were 
filtered out. 

To eliminate nonspecific inhibitors, the effect of the 
remaining 72 compounds on 3CL pro activity in the pres¬ 
ence of bovine serum albumin (BSA) was determined. 
If the degree of inhibition of 3CL pro by a compound is 
decreased in the presence of BSA, inhibition is not likely 
to be targeted exclusively toward 3CL pro . These so- 
called “promiscuous” inhibitors likely form aggregates 
in solution that nonspecifically absorb or adsorb the 
target enzyme [39]. Only three compounds showed any 
marked reduction in enzyme inhibition in the presence 
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Table 1. Some of the Functional Groups of the 69 Filtered Hits from the Screening Campaign that Are Potentially Reactive with the 
Catalytic Thiol in the Active Site of 3CL pro 
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a R1, R2, and R3 are a variety of aromatic and aliphatic substituents. 

b The sum of the number of compounds in each group is greater than 69 as several compounds contained more than one of the listed 
functionalities. 


of BSA, with the greatest effect being a decrease of 
only 20%; these compounds were removed from the 
prospective inhibitor pool. 

The remaining 69 potential candidates were predomi¬ 
nated by compounds with at least one potentially reac¬ 
tive center (Table 1). The thiol side chain of the active 
site cysteine of 3CL pro is necessarily a good nucleophile 
and therefore has the capacity to combine with a variety 
of electrophilic functionalities to form covalent adducts. 
This reaction is the basis for several classes of existing 
peptidic inhibitors of cysteine proteases which include 
a-p-unsaturated ketones, esters, and amides [40, 41] 
(Table 1, 1-4) and nitriles [42] (Table 1, 15). In addition, 
A/-substituted maleimides (Table 1, 14) have long been 
recognized as being reactive toward thiols in general, 
and cysteine proteases in particular [43]. The two identi¬ 
fied sulfur containing compounds (Table 1, 13) are also 
capable of forming disulfide linkages with the catalytic 
thiol of 3CL pro to inactivate the enzyme [44] The re¬ 
maining carbonyl functionalities and corresponding ni¬ 
trogen- and sulfur-containing analogs listed in Table 1, 
5-12, may also have enough electrophilic character to 
promote the nucleophilic attack of the active site thiol 
of 3CL pro . Due to the potential inherent reactivity of the 69 
candidate inhibitors, we sought to identify and eliminate 
those compounds whose high reactivity rendered them 
nonspecific toward the active site of 3CL pro by including 
1,4-dithio-D,L-threitol (DTT) in the inhibition reaction. A 
reduction in the inhibition of 3CL pro by any candidate 
compound in the presence of DTT suggests that such 
a compound would probably react nonspecifically with 
any biologically available thiol functionality [40, 41]. Al¬ 


though this renders such compounds as unlikely drug 
candidates, they may be useful in studying the active 
site architecture of 3CL pro . Of the remaining 69 candidate 
inhibitors, the inhibitory action of 5 of these was not 
significantly affected by the inclusion of 1 mM DTT; 
these were selected for further characterization. 

The dose-response curves for these five compounds 
with 3CL pro yielded IC 50 values of 0.5-7 |xM (Figure 5). 
To evaluate the selectivity of each compound, we inves¬ 
tigated their ability to inhibit four other proteinases with 
varying structural and mechanistic relatedness to SARS- 
CoV 3CL pro . The Hepatitis A virus (HAV) 3C pro , like all 
picornaviral 3C pro s, is similar in structure, mechanism, 
and substrate specificity to 3CL pro [11,14,45]. The Hepa¬ 
titis C Nonstructural 3 proteinase (NS3 pro ) and chymo- 
trypsin are serine proteases with the same two (3-barrel 
fold as 3CL pro [13, 46]. Finally, papain is a paradigm 
cysteine proteinase with an active site Cys/His/Asp ca¬ 
talytic triad whose structural fold does not resemble 
that shared by chymotrypsin, NS3 pro , 3CL pro , and 3C pro . 
It was difficult in some cases to obtain a full dose- 
response curve for every inhibitor with each of the pro¬ 
teinases due to interfering compound fluorescence and/ 
or compound insolubility under each of the assay condi¬ 
tions. In such cases, if the lack of data at higher com¬ 
pound concentrations precluded interpolation of a reli¬ 
able IC 50 value, the IC 50 is reported as a lower limit (Table 
2). Alternatively, those compounds for which enzyme 
inhibition was not seen at the highest concentration 
tested, are indicated accordingly. It is important to note 
that in the absence of full mechanistic analyses, the IC 50 
values are intended to reveal profound differences in 
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Figure 5. Dose-Response Curves for the Five Secondary Hits from 
the Screening Campaign 

IC 50 values were extracted from the curves using nonlinear regres¬ 
sion analysis (see Experimental Procedures). IC 50 values for the five 
secondary hits were 0.5 ± 0.3 |xM (MAC-5576), 4.3 ± 0.5 |xM (MAC- 
8120), 7 ± 2 |jlM (MAC-13985), 2.6 ± 0.4 |xM (MAC-22272), and 7 ± 
3 |xM (MAC-30731). The corresponding structure of each compound 
is shown on each dose-response curve. 

potency to provide insight into the selectivity of the 
inhibitors, and to aid in their prioritization as leads for 
continuing studies. 

Compound MAC-22272 showed the least selectivity 
with respect to 3CL pro (/C 50 = 2.6 |jlM), as it was the only 
inhibitor capable of inactivating all of the proteinases 
studied, although the response with NS3 pro [IC 50 > 500 
|jlM) was significantly lower than with the other enzymes. 
MAC-5576 showed some selectivity toward those pro¬ 
teinases with a chymotrypsin-like fold, as it was seen 
to inhibit 3CL pro (IC 50 = 0.5 |xM), HAV 3C pro (IC 50 = 0.5 
|jlM), and chymotrypsin (IC 50 = 13 |jlM), although no effect 
was seen with NS3 pro , which also shares this structural 
motif. Papain, which lacks this fold, was unaffected by 
MAC-5576. An important difference between 3CL pro and 
3C pro is that the coronaviral enzyme is only functionally 
active as a dimer whereas 3C pro is active as a monomer 
in vitro. The interaction between the 3CL pro subunits is 
sufficiently weak that disrupting the dimerization inter¬ 
face has been proposed as a promising target for novel 
antiviral agents [32]. In view of the relatedness between 
3CL pro and HAV 3C pro , it is therefore likely that of the five 
secondary hits, those that are also active toward HAV 
3C pro do not interact with 3CL pro at its dimer interface, 
but at a site that is common to both proteinases. 


Considering the mechanistic and structural relation¬ 
ships between 3CL pro and HAV 3C pro , and to a lesser 
extent NS3 pro , we were interested to learn that some of 
the compounds showed highest activity against these 
related enzymes. In addition to inhibiting 3CL pro with an 
IC 50 of 7 |xM, compound MAC-30731 was observed to 
have relatively potent activity against HAV 3C pro [IC 50 = 
54 |jlM) and NS3 pro (IC 50 = 71 |jlM), with only weak impact 
on chymotrypsin [IC 50 > 800 |jlM). 

MAC-8120 and MAC-13985 also demonstrated inter¬ 
esting selectivity profiles with good activity against 
SARS-CoV 3CL pro (/C 50 values were 4.3 and 7 puM, respec¬ 
tively) and no detectable inhibition of any of the other 
proteinases tested. These particular compounds, how¬ 
ever, proved to be difficult in the assays of HAV 3C pro 
and NS3 pro due to background fluorescence, and could 
not be tested at concentrations greater than 5 |jlM. Nev¬ 
ertheless, no detectable inhibition was seen at that con¬ 
centration, indicating that an IC 50 for these molecules 
would necessarily be much higher. 

The protease selectivity data, therefore, point to two 
molecules with apparent selectivity for 3CL pro , MAC- 
8120 and MAC-13985. These may well be the most 
promising leads for further mechanistic characterization 
and optimization efforts toward a protease-based antivi¬ 
ral for SARS-CoV. One molecule, MAC-30371, has an 
interesting selectivity for the picornaviral-like protein¬ 
ases and could prove to be a viable lead for this family 
of proteolytic enzymes. 

A search of the literature for any studies involving the 
five secondary hits showed that at present, none have 
been identified as proteinase inhibitors. Of particular 
significance, however, is the finding that MAC-8120 has 
been identified as an inhibitor of triosephosphate isom- 
erase from Trypanosoma cruzi, T. brucei, human, and 
yeast [IC 50 = 56 |jlM to 4 mM) [47]. This study examined 
the inhibition of the isomerase by a series of substituted 
benzothiazoles, and indicated that MAC-8120 inter¬ 
acted with cysteine residues on the enzyme that were 
previously shown to be central in dimerization. This in¬ 
teraction, although as yet uncharacterized, was reported 
to perturb the contact interface of the subunits of the 
homodimer to inactivate the enzyme. It is therefore quite 
possible that MAC-8120 associates with the active site 
cysteine of 3CL pro in a similar manner to disrupt its cata¬ 
lytic activity. The exact mechanism of inhibition of 3CL pro 
by MAC-8120 and the remaining secondary hits requires 
further study and characterization in order to evaluate 
their potential to act as leads for the development of 
novel therapeutics against SARS-CoV, and potentially, 
other picornaviral-like proteinases. Additionally, the 
core structure of these compounds can be chemically 
modified to improve key physical properties such as 
potency and solubility. 

It is clearly evident that the 3CL pro screening campaign 
has generated a wealth of data that can be mined for a 
number of studies. While it was our intention to identify 
novel small molecules that would lead to the develop¬ 
ment of therapeutics effective against SARS-CoV, the 
results generated by the screening campaign can be 
applied to other types of studies of 3CL pro and related 
proteinases including the picornaviral 3C pro . To this end, 
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Table 2. Summary of IC 50 Values of the Five Secondary Hits with SARS-CoV 3CL pro and Four Other Proteases 




ic 50 (^M) a 





Compound 

Structure 

3CL pro 

HAV 3C pro 

NS3 pro 

Chym b 

Papain 








MAC-5576 

°TY 

N 

0.5 ± 0.3 

0.54 ± 0.09 t 

N.l. (500) c 

13 ± 5 

N.l. (500) 

MAC-8120 

XXyo-^ 

0=S-OH 

|| 

4.3 ± 0.5 

N.l. (5) 

N.l. (5) 

N.l. (500) 

N.l. (500) 


o 






MAC-13985 

jT 

F F 

7 ± 2 

N.l. (5) 

N.l. (5) 

N.l. (500) 

N.l. (500) 

MAC-22272 

^ jjCvnh, 

a 

2.6 ± 0.4 

0.9 ± 0.1 

>500 d 

29 ± 12 

18 ± 9 

MAC-30731 


7 ± 3 

54 ± 16 

71 ± 14 

>800 

N.l. (500) 


N^S O 







a To calculate IC 50 values, dose-response data were fit to a 3-parameter equation (Equation 2), excluding f, in which a 4-parameter equation 
(Equation 3) was used (see Experimental Procedures). 
b Bovine pancreas chymotrypsin. 

c No inhibition seen at the concentrations of compound tested; highest concentration tested in brackets (in |xM). 
d Protease inhibition was seen, but insufficient data were obtained to calculate a reliable IC 50 value. 


we have made the complete primary screening data set 
and corresponding compound structures available for 
download from the McMaster HTS Lab (http://hts. 
mcmaster.ca/sars). Interested parties are invited to re¬ 
view the data and mine it for their own studies. 

Significance 

In response to the initial SARS outbreak of 2003, a 
concerted effort by the international scientific commu¬ 
nity identified the causative agent of this infection as 
a novel coronavirus, SARS-CoV. Characterization of 
this virus and the development of appropriate thera¬ 
pies soon followed and remains a continuing global 
effort. The main proteinase of SARS-CoV, 3CL pro , plays 
a crucial role in viral replication and is therefore an 
excellent target for therapeutics effective in treating 
SARS. Our contribution to the anti-SARS initiative was 
to employ a high-throughput screening strategy to 
identify small molecules that specifically target 3CL pro 
in order to generate new therapeutic leads. HTS is an 
efficient and rapid means of assessing the effect of a 
large number of compounds on a target of interest; 
we feel that such campaigns should play an integral 
part of any initial investigations undertaken to combat 
newly emerging infectious agents. 

A quenched-FRET assay was developed and used 
to evaluate the activity of 3CL pro in the presence of 
50,000 small drug-like molecules; of these com¬ 
pounds, 572 were shown to inhibit 3CL pro . Utilizing a 
series of filters, this number of compounds was re¬ 
duced to five potent hits (/C 50 = 0.5-7 |jlM) that repre¬ 
sent novel leads. Secondary studies of these leads will 


investigate their mode of inhibition of 3CL pro and their 
potential to provide a basis for innovative antiviral ther¬ 
apies. 

By virtue of the number of molecules evaluated, this 
campaign has resulted in an extensive data set. As 
well as identifying inhibitors of 3CL pro , the primary data 
from the screen can be utilized in numerous studies, 
such as identifying structure-activity relationships, by 
others investigating 3CL pro in particular and other main 
proteinases in general. For this reason, the authors 
have made the primary screening data set publicly 
available in the hope that researchers will find this 
information useful in future studies. 

Experimental Procedures 

Chemicals and Reagents 

All chemicals were of analytical grade and used without further 
purification. Restriction enzymes were purchased from New En¬ 
gland Biolabs (Pickering, ON, Canada). All peptide substrates and 
activating peptides were synthesized by the Peptide Synthesis Lab¬ 
oratory, NAPS Unit, University of British Columbia (Vancouver, BC, 
Canada). 

Strains, Media, and Growth 

DNA was propagated using Escherichia coli DH5a. The protease 
was expressed using E. coli GJ1158 [48] transformed with pT7-7 
derivatives [49]. E. coli strains were cultured at 37°C and 250 rpm 
in Luria-Bertani (LB) broth supplemented with the appropriate antibi¬ 
otics. For expression, E. coli GJ1158 was cultured in LB broth with 
100 |xg/mL ampicillin. 

Construction of Plasmids and Overexpression 
DNA was manipulated using standard protocols [50]. A vector was 
designed to express 3CL pro as an N-terminal, cleavable his-tagged 
protein. The expression vector was derived from pT7HP20, a deriva- 
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five of pT7-7 [49] containing an approximately 300 bp Xbal/Hindlll 
fragment from pLEHP20 [51]. A plasmid containing the coding se¬ 
quence of the SARS-CoV 3CL pro was a generous gift from Genome 
Sciences Centre (Vancouver, BC, Canada). The gene encoding 
3CL pro was amplified by PCR using the following oligonucleotides: 
SPH (5'-CCCA GCTAGC GGTTTTAGGAAAATGGCATTC-3) and 3SP 
(5-TACG AAGCTT GTCGACTTATTGGAAGGTAACACCAGAGCA-3). 
SPH introduces an Nhel site (underlined) at the beginning of the 
gene and 3SP introduces a Hindlll site (underlined) and a stop codon 
(boldface) at the end of the coding sequence. The PCR reaction 
was performed using the Expand High Fidelity DNA polymerase 
system (Roche Applied Sciences, Laval, PQ, Canada) according to 
the manufacturer’s instructions using an annealing temperature of 
50°C. The resulting amplicon was digested with Nhel and Hindlll 
and cloned in pT7HP20 yielding the expression plasmid pT7HSP1. 
The cloned gene was sequenced using an ABI 373 Strech (Applied 
Biosystems, Foster City, CA) using Big-Dye terminators to verify its 
authenticity. The amplified gene and the original clone had a deletion 
(corresponding to guanine at position 32 of the coding sequence). 
The mutation was repaired in pT7HSP1 using a convenient Bsml 
site. Accordingly, the oligonucleotide S3CDM (5 -GAAAAT GGCATT 
CCCGTCAGGCAAAGTTGAAGG-3) contains the Bsml restriction 
site (underlined) and the missing nucleotide (boldface). A second 
PCR reaction was performed using S3CDM and 3SP and Pwo DNA 
polymerase (Roche Applied Sciences) according to the manufactur¬ 
er’s conditions, using an annealing temperature of 50°C. The re¬ 
sulting amplicon was digested using Bsml and Hindlll, and was used 
to replace the corresponding fragment in pT7HSP1. This yielded 
pT7HSP2, whose nucleotide sequence was confirmed as described 
above. 

The his-tagged 3CL pro (ht-3CL pro ) was expressed in E. coli GJ1158 
freshly transformed with pT7HSP2. The cultures were grown to an 
OD 600 of ^0.5 in LB with low salt before inducing the expression of 
the protein by addition of 0.3 M NaCI. The cultures were incubated 
for an additional 12 hr before harvesting. 

Protein Purification 

All buffers were prepared using water purified on a Barnstead 
NANOpure UV apparatus to a resistivity of greater than 17 Mflcm. 
Unless otherwise specified, all manipulations involving the protein 
were performed at 4°C. Cells from 4 liter of culture were harvested 
by centrifugation. The cell pellet (approximately 10 g) was washed 
twice with 500 ml of 20 mM sodium phosphate (pH 8.0), 10% glycerol 
and then was resuspended in 10 ml of the same buffer, containing 

1 mM MgCI 2 , 1 mM CaCI 2 , and 0.1 mg/ml DNase I. The cells were 
disrupted by three successive passages through a French Press 
(Spectronic Instruments Inc., Rochester, NY) using an operating 
pressure of 20,000 p.s.i. The cell debris was removed by ultracentrif¬ 
ugation at 37,000 rpm for 90 min in a T1250 rotor (DuPont Instru¬ 
ments, Wilmington, DE). The clear supernatant fluid (raw extract) 
was decanted. 

The raw extract (approximately 20 ml) was loaded onto a glass 
Econo-Column column (Bio-Rad, Hercules, CA) containing 5 ml of 
Ni-NTA agarose resin (Qiagen Inc., Mississauga, ON, Canada) pre¬ 
equilibrated with 20 mM sodium phosphate (pH 8.0) and 10% glyc¬ 
erol. The column was then washed by the successive passage of 
4 column volumes (CV) each of 20 mM sodium phosphate (pH 8.0); 
20 mM sodium phosphate (pH 6.3) and 300 mM NaCI; and 20 mM 
sodium phosphate (pH 8.0), 20 mM imidazole, and 300 mM NaCI. 
The column was then washed with 2 CV of 20 mM sodium phosphate 
(pH 8.0) and 20 mM imidazole. The proteinase was eluted using 

2 CV of 20 mM sodium phosphate (pH 8.0) and 150 mM imidazole. 
The eluted protein was exchanged into 50 mM Tris-HCI (pH 8.0) by 
performing three rounds of concentration-dilution using a stirred cell 
concentrator equipped with a YM10 membrane (Amicon, Etobicoke, 
ON, Canada). The protein solution was concentrated to 16.8 mg/mL 
and frozen as beads in liquid nitrogen. A yield of 185 mg of ht-3CL pro 
was obtained from 4 liter of cell culture. Removal of the his-tag did 
not significantly affect 3CL pro cleavage activity. 

SARS-CoV 3CL pro Kinetic Assay 

SARS-CoV 3CL pro activity was measured by a quenched fluores¬ 
cence resonance energy transfer (FRET) assay with the peptide 


substrate 2-aminobenzoyl-SVTLQSG-Tyr(N0 2 )-R. The rate of en¬ 
zyme activity was determined by the increase in fluorescence upon 
continuous monitoring of reactions in 384-well black microplates 
(Nalge Nunc International, Rochester, NY) (excitation wavelength = 
330 nm, emission wavelength = 420 nm). Total assay volume was 
50 |xL, and contained 1 |xM 3CL pro , 100 |xM peptide substrate, 4.2% 
dimethyl sulfoxide (DMSO) (v/v) (Sigma, Oakville, ON, Canada), and 
100 mM potassium phosphate (pH 8.0) (Sigma), unless noted other¬ 
wise. Reactions were done at room temperature, and were initiated 
by the addition of substrate. Steady-state kinetic parameters were 
evaluated using 16 |xM-1 mM substrate and 1 |jlM 3CL pro ; these data 
were corrected for inner filter effect by the method of Liu et al. [31] 
with the use of Abz-SVRLQ to calculate correction factors. 

Primary Screening 

The screen of 3CL pro against 50,000 small molecules was fully auto¬ 
mated with the use of a SAGIAN Core System (Beckman Coulter, 
Inc., Fullerton, CA) equipped with an ORCA arm for labware trans¬ 
portation, a Biomek FX with a 96-channel head for liquid handling, 
and an Analyst HT (Molecular Devices Corp., Sunnyvale, CA) for 
fluorescence detection; the entire system was integrated through 
SAMI (v. 3.5, Beckman Coulter). Reactions were done in duplicate, 
and contained 10 |xM library compounds sourced from Maybridge 
pic (Cornwall, UK), and peptide and 3CL pro as above. Wells con¬ 
taining high and low controls, where neat DMSO was added instead 
of library compounds, were included on each microplate; buffer was 
added to the reaction instead of 3CL pro for low controls. Primary 
hits were defined as those compounds that reduced the activity of 
3CL pro to half of the average residual activity of the high controls. 
Activity Base (v. 5.0.5, ID Business Solutions Limited, Emeryville, 
CA), SARgen (v. 1.0, ID Business Solutions Limited), and Spotfire 
DecisionSite (v. 7.1.1, Spotfire Inc., Somerville, MA) were used for 
data analysis. Compounds identified as primary hits were grouped 
by functionality using ChemTree (v. 3.2.1, Golden Helix, Bozeman, 
MT); a representative compound from each cluster was selected 
for secondary screening. 

Secondary Screening 

The dose-response curves of the representative hits from primary 
screening were determined by 3CL pro activity assays (as described 
above), in the presence of seven concentrations of inhibitor (0.1 -100 
|xM) and 100 |xM substrate. Data was fit to 


using GraFit (v. 4.0.10, Erithacus Software Ltd., Surrey, UK) (where 
v is the background corrected reaction rate, a is the reaction rate 
in the absence of inhibitor, [/] is the concentration of inhibitor, and 
s is the slope factor) to calculate IC 50 values. Compounds exhibiting 
sigmoidal semilogarithmic dose-response curves were tested by the 
described 3CL pro activity assay (with 31.2 |xM of tested compound) in 
the presence and absence of 0.01 % (w/v) bovine serum albumin 
(BSA) (Sigma) and 1 mM 1,4-dithio-D,L-threitol (DTT) (BioShop Can¬ 
ada Inc., Burlington, ON, Canada) in two separate studies. Those 
compounds whose inhibition of 3CL pro was not affected by the pres¬ 
ence of BSA or DTT were classified as secondary hits. 

Evaluation of Hit Specificity 

The dose-response of the secondary hits was determined with pa¬ 
pain (Calbiochem, La Jolla, CA), bovine pancreas chymotrypsin (Cal- 
biochem), Hepatitis A 3C proteinase (HAV 3C pro ) (purified according 
to [52]) and the Hepatitis C Nonstructural 3 proteinase (NS3 pro ) (181 
amino-terminal portion of the enzyme purified according to [46]) in 
the presence of five to eleven concentrations of each inactivator (5 
nM-500 |xM). Activity assays for papain (5 nM) and chymotrypsin 
(200 pM) were done at room temperature in a total volume of 50 |xl 
in black 384-well microplates (Corning, Corning, NY). Activity assays 
for HAV 3C pro (0.5 |xM) and NS3 pro (0.23 |xM) were done in black 
96-well microplates in a total volume of 200 |xl at 37°C and 30°C, 
respectively. The substrate for papain and chymotrypsin was a ca¬ 
sein derivative extensively labeled with the green-fluorescent BOD- 
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IPY FL dye (Molecular Probes, Eugene, OR) (50 and 10 nM substrate 
for papain and chymotrypsin assays, respectively). Substrates for 
HAV 3C pro and NS3 pro were 10 |xM Dabcyl-GLRTQSND(Edans)G and 
100 |xM Abz-DDIVPCSMSY(N0 2 )T, respectively. Enzyme activity for 
all proteases was determined by the increase in fluorescence upon 
continuous monitoring of reactions using either the Analyst HT (Mo¬ 
lecular Devices Corp.) or Victor 2 (PerkinElmer, Woodbridge, ON, 
Canada) fluorescence plate readers. Excitation and emission wave¬ 
lengths, respectively, for each proteinase assay were 485 and 530 
nm (papain and chymotrypsin), 355 and 460 nm (HAV 3C pro ), and 
340 and 430 nm (NS3 pro ). Buffer systems for each protease were as 
follows: papain, 10 mM MES (pH 6.2), 200 mM NaCI, 2 mM EDTA, 
and 1 mM DTT; chymotrypsin, 50 mM Tris/HCI (pH 7.8); HAV 3C pro , 
100 mM potassium phosphate (pH 7.5), and 2 mM EDTA; NS3 pro , 50 
mM HEPES (pH 7.3), 150 mM NaCI, 0.1% Triton X-100, and 1 mM 
DTT. For the NS3 pro activity assay, the enzyme was incubated with 
15 |xM 4A activating peptide (acetyl-KKKGSVVIVGRIILSGR-NH 2 ) at 
30°C for 15 min in reaction buffer, then incubated with inactivator 
for an additional 15 min before initiation of the reaction with sub¬ 
strate. Dose-response data for these proteases with the secondary 
hits were fit to Equation 2 above, excluding HAV 3C pro with MAC- 
5576, which was fit to 


(where c is the calculated background) since enzyme activity at the 
highest concentration of inhibitor did not fall to zero. 

Mass Spectrometry 

The mass of each secondary hit was confirmed by the McMaster 
Regional Centre for Mass Spectrometry (Hamilton, ON, Canada) 
using a Quattro Ultima triple quadrupole mass spectrometer (Wa¬ 
ters/Micromass, Manchester, UK) using an electrospray ionization 
source. 
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